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This is a division of application Ser. No. 08/163,091 filed Dec. 7, 1993 now U.S. Pat. 

No. 5,517,641, which was a continuation in part of application Ser. No. 07/889,454 filed 
May 27, 1992 now U.S. Pat. No. 5,408,654. 

INT-CL: [6] G06F 17/30 
US-CL-ISSUED: 707/202; 707/101 
US-CL -CURRENT: 707/202; 707/101 

FIELD-OF- SEARCH : 395/611, 395/612, 395/616, 395/618, 395/617, 395/621 
PRIOR- ART-DISCLOSED : 



U.S. PATENT DOCUMENTS 

j Search Selected 



Search ALL 





PAT -NO 


IS SUE -DATE 


PATENTEE -NAME 


US-CL 




4679139 


July 1987 


Durbin 


395/601 


□ 


4890226 


December 198 9 


Itoh 


395/417 


□ 


5034914 


July 1991 


Osterlund 


395/872 


□ 


5121493 


June 1992 


Ferguson 


395/607 


□ 


5204958 


April 1993 


Cheng 


395/613 


□ 


5222235 


June 1993 


Hintz 


395/612 


□ 


5257362 


October 1993 


Menon 


395/441 


□ 


5274805 


December 1993 


Ferguson 


395/607 



OTHER PUBLICATIONS 

Walker, H., Introduction to Computing and Computer Science with Pascal, pp. 246-247, Jan. 
1986 . 

Hauser et al . , " DB2 2.3 Reorg Tablespace Performance," DB Journal, Aug. 1992, pp. 24-29. 



lof2 



9/27/01 7:3 i AM 



Record Display Form http://westbrs:8820^ga^ 
IBM ,DB2 Utilities Guide, Re^A Chap, pp. 103-114. 

IBM DB2 Command and Utility Preference, pp. 266-272, Reorg Utility. 
Platinum, Put User Guide, pp. 4-1 to 4-10, 1992. 
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PRIMARY- EXAMINER: Black; Thomas G. 

ASSISTANT -EXAMINER: Choules; Jack M. 

ATTY- AGENT- FIRM: Pravel, Hewitt, Kimball & Krieger 

ABSTRACT : 

An improved method to dramatically reduce the time required to reorganize DB2 tablespaces 
and index files by not utilizing conventional sort techniques. Viewing access is allowed 
during the reorganization process by setting the files to read only status. The process 
is basically non-destructive, allowing a prompt return to the original state, and is 
checkpointed, allowing restarting at selected intervals. Briefly, the original table and 
indices are considered as A files and read into memory. New row IDs or RIDs are developed 
using a non- sorting technique so that the proper order of the data is developed. After 
the new RIDs have been developed, both the clustering index and the row data are read out 
of memory and written to a new table and clustering index files in the proper order as B 
files. All of the table files are then stopped to allow exclusive access. Next, a series 
of AMS statements are built to do the renaming operations. Specifically, a series of 
statements are built to rename all of the A files to X files, to rename all B files to A 
files and then to delete all of the X files. Then any remaining non-clustering indices 
are reorganized using non- sorting techniques and renamed in a similar fashion. 
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Abstract 



A system and method for recovering a 
database table (200) that depends on a 
tablespace (206) receives a backup copy (202) 
of the tablespace (206) and reads log records 
(204) associated with the table (200). The 
system then applies the log records (204) to the 
backup copy (202) and builds new table data 
pages from the backup copy (202). Finally, the 
system scans (214) the hew table data pages for records of the first table and 
updates the table from the records. 
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ART-UNIT: 277 

PRIMARY- EXAMINER: Homere; Jean R. 
ATTY- AGENT- FIRM: Baker & McKenzie 



In response to a constraint violation in a row of a database table, an output file is 
generated including the characteristics of the table containing the row in error as well 
as an SQL UPDATE statement for the row. The SQL UPDATE statement includes the column 
values in the row which can be corrected by the user, the user modified SQL UPDATE 
statement being subsequently executed to repair the constraint violation. 

18 Claims, 8 Drawing figures 
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TITLE: Method for repairing constraint violations in a database management system 



BSPR : 

A well known database software program is DATABASE 2 (DB2) database software distributed 
by IBM Corporation. As is known in the art, DB2 operates as a subsystem in a computer 
system operating under the IBM MVS operating system software. In a DB2 environment, user 
data resides in DB2 tables which are in tablespaces . A tablespace is, for example, a 
portion of storage space in a direct access storage device (DASD) such as a disk drive. 
For exemplary purposes, illustrated below is an order_entry table that would be stored 
in a tablespace. The order_entry table contains columns: customer_number ; product_code; 
order_number; buyer_name; and ship__to_zip . 

DEPR : 

FIG 5 illustrates exemplary initialization processing of a CHECK utility according to 
an embodiment of the present invention, such as performed in step 410 of FIG. 4. In step 
510, a user of the CHECK utility provides a name of a tablespace which is to be subject 
to constraint enforcement. For example, the user can input the name of the tablespace 
via an I/O device such as a keyboard to the computer system operating the database 
management system and the CHECK utility. In step 520, the CHECK utility identifies the 
database table located in the tablespace identified by the user, for example by reading 
the DB2 catalog. Usually, there is only one database table in a tablespace. 
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ABPL: 

A database containing datafiles is partitioned into a set of tablespaces . Every disk 
pointer pointing to a data item in a datafile refers to a tablespace-relative file 
number for the datafile. Data pointed to by a tablespace-relative disk pointer is 
retrieved by first checking the cache, and upon a cache miss, the tablespace-relative 
file number is translated into an absolute file number according to a latch- free look up 
technique . 

BSPR: 

In accordance with an aspect of the invention, a method of retrieving a data item from a 
computer database includes partitioning the database into a set of tablespaces and 
storing references to data items as tablespace -relative pointers, indicating a location 
relative to the tablespace containing the data item. A data item is retrieved from any 
one of the datafiles, by reading a tablespace-relative pointer, determining a tablespace 
identity from an operating context, and locating the data item based on the 
tablespace-relative pointer and the tablespace identity. 

BSPR: 

In accordance with another aspect of the invention, a database system comprises a set of 
datafiles and a set of tablespaces forming a partition of and containing the set of 
datafiles. Also contained in the set of datafiles is a set of data items. References to 
the data items are stored in the set of datafiles as tablespace-relative pointers, 
indicating the location of a corresponding data item relative to the tablespace 
containing the corresponding data item. In another embodiment, the database system is 
configured to locate a data item within the set of datafiles by reading a 
tablespace -relative pointer, determining a tablespace identity from an operating 
context, and locating the data item based on the tablespace -relative pointer and the 
tablespace identity. 

BSPR: 

In accordance with a further aspect of the invention, a computer readable medium has a 
sequence of instructions for logically partitioning a set of datafiles of a database 
into a set of tablespaces stored upon it. In another aspect, the computer readable 
medium has instructions for storing references to data items that are contained in the 
set of datafiles as tablespace-relative pointers. Each tablespace-relative pointer 
indicates a location of a corresponding data item relative to the tablespace containing 
the corresponding data item. In a further aspect, the computer readable medium has 
instructions for locating a data item within any one of the datafiles by reading a 
tablespace-relative pointer, determining a tablespace identity from an operating 
context, and locating a data item based on the tablespace-relative pointer and the 
tablespace identity. 

BSPV: 

U.S. patent application Ser. No. 08/852,968 entitled "Pluggable Tablespaces for Database 
Systems," filed by William H. Bridge, Jr., Jonathan D. Klein, J. William Lee, Juan R. 
Loaiza, Alex Tsukerman, Gianfranco Putzolu on May 8, 1997, (now U.S. Pat. No. 5,890,167 
issued Mar. 30, 1999), incorporated herein by reference/ and 

DRPR: 

FIGS. 12(a), 12(b), and 12(c) are flowcharts illustrating the operation of unplugging a 
set of tablespaces from a database according to embodiments of the present invention. 

DRPR: 

FIGS. 13(a), 13(b), and 13(c) are flowcharts illustrating the operation of plugging a 
set of tablespaces into a database according to embodiments of the present invention. 

DEPR: 

To increase the addressing range of disk pointers, groups of related datafiles are 
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collected into tablespaces . ^JPablespace is a collection of onl^Rr more datafiles. 
Tablespaces function as a unit of object placement, space administration, and 
point-in-time recovery. Every datafile within a database belongs to exactly one 
tablespace, and whenever a new datafile is added to a database, it is always added to a 
specific tablespace . A table or an index may be undivided into smaller units, called 
partitions. Each partition is stored in the datafiles of a tablespace. Hence, a table or 
an index may belong to a plurality of tablespaces. 

DEPR: 

For example, database 2 00 can be partitioned into six tablespaces as shown in database 
400 in FIG. 4. Database 400 comprises six tablespaces, 410 to 420. Datafiles 212 and 214 
belong to tablespace 412, and datafiles 216, 218, and 220 belong to tablespace 414. 
System tablespace 410 comprises data dictionary 240. Control file 242 is kept separate, 
not part of a tablespace. 

DEPR: 

When a tablespace is created within a database, it is assigned a tablespace number 
(TSN) , which is unique for that database. The tablespace number is limited to a word 
size, on a 32-bit machine signifying a theoretical limit of about four billion. Each 
tablespace are associated with a control list of datafiles, containing a 
tablespace-relative file number (TRFN) and the corresponding datafile. Thus in FIG , 4, 
tablespaces 410 to 420 each associated with control lists 430 to 440. In particular, 
tablespace 412 is associated with control list 432, which has entries indicating that a 
datafile having TRFN of 1 corresponds to the datafile having an AFN of 2. Likewise, a 
datafile with a TRFN of 2 corresponds to the datafile with an AFN of 3 . Control lists 
430 to 440 are actually kept in control file 242 but are depicted in FIG. 4 near the 
corresponding tablespaces for clarity. 

DEPR: 

A TRFN is unique among the datafiles of a tablespace, but need not be unique among all 
the datafiles of a database . In fact, a datafile may have different 

DEPR: 

The tablespace -relative pointer addressing scheme allows a maximum of 1024 datafiles per 
tablespace, not database, and over four billion tablespaces per database . As a result, 
the allowable number of datafiles is no longer limited by the 10-bit absolute file 
number field in the disk pointer, but by the word size of host computer. In short, the 
theoretical maximum number of datafiles per database is increased to over four billion 
datafiles per database. 

DEPR: 

Tablespace-relative disk pointers also facilitate the transfer of a group of datafiles 
within a tablespace without having to patch the disk pointers. Tablespace-relative file 
numbers need not be unique between tablespaces, only within a tablespace. Therefore, 
plugging a new tablespace into a database does not create any file number conflicts in 
the disk pointers, allowing the transfer to be made without having to patch the disk 
pointers . 

DEPR: 

Translating the TSN : TRFN pair into the proper AFN, however, brings up the issue of 
concurrency. In a concurrent database, more than one user at a time may simultaneously 
update the database, possibly adding or removing datafiles from a tablespace . Generally, 
data structures are protected in a concurrent environment by obtaining a latch for a 
critical section of processing. A latch in this context is a mutually exclusive lock and 
may be implemented in a variety of ways known in the art, including but not limited to 
semaphores . 

DEPR : 

To facilitate the transfer of a group of datafiles from one database to another, groups 
of related datafiles are collected into tablespaces . A tablespace is a collection of one 
or more datafiles. Tablespaces function as a unit of object placement, space 
administration, and point-in- time recovery. Every datafile within a database belongs to 
exactly one tablespace, and whenever a new datafile is added to a database, it is always 
added to a specific tablespace . 

DEPR: 

For example, database 200 can be partitioned into six tablespaces as shown in database 
500 in FIG. 8. Database 500 comprises six tablespaces, 810 to 820. Datafiles 212 and 214 
belong to tablespace 812, and datafiles 216, 218, and 220 belong to tablespace 814. 
System tablespace 810 comprises data dictionary 240 and control file 242. 

DEPR: 

According to an embodiment of the invention, transferring data between two databases has 
two phases. In the first phase, a user "unplugs" a set of tablespaces, containing the 
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desired data, from a source abase . Unplugging a set of tablWyaces is started by- 
issuing an "unplug" command to a database system, which performs in response the steps 
shown in FIG. 12(a) . At step 1200, the database system gets a specification of the 
tablespaces to be transferred, called the "pluggable set." Step 1210 receives the name 
of an export/import file from the user. For example, a user desiring to transfer the 
data in table 250 and index 260 from database 800 specifies tables 812 in the unplugging 
operation. With the pluggable set and the name of the export /import file, the source 
database produces a set of files (steps 1204 and 1206) that the user may then copy to a 
place accessible to the target database. 

DEPR: 

According to one embodiment of the invention, the unplug operation in step 12 06 removes 
the set of tablespaces from the source database ; however, another embodiment of the 
invention leaves the set of tablespaces unchanged in the source database . A preferred 
embodiment of the invention enables both operations. In this situation, the former 
operation is termed "unplugging" a set of tablespaces, and the latter operation, 
"copying" a set of tablespaces. 

DEPR: 

Given a set of unplugged or copied tablespaces, the user may then plug the set of 
tablespaces into a target database by issuing a plug- in command with the name of 
export/import file. The metadata is reconstructed from the pluggable set and the 
plugged- in tablespaces become new tablespaces in the target database. 

DEPR: 

According to another embodiment, the target database accesses the set of tablespaces 
without copying the tablespaces . In step 1310, the target system received the pluggable 
on a computer- readable medium in a drive. After prompting for the name of the 
export/import file (step 1312) and importing the metadata (step 1314) , the target system 
accesses the tablespaces in the pluggable set directly (step 1316) , without patching the 
disk pointers (step 1318) . 

DEPR: 

In a preferred embodiment, both approaches are permitted according to the presence or 
absence of a "read-only" option. If the read-only option is not specified, then the 
tablespaces are copied in; on the other hand, if the read-only option is specified, then 
the tablespaces are used in the drive directly. The read-only option is useful with 
plugging in tablespaces published on a CD-ROM, because the target database will use the 
tablespaces by reading the CD-ROM drive without copying the tablespaces into the target 
database . 

DEPR: 

The two difficulties in the prior art due to the internal structure of databases are 
handled by using tablespace - relative disk pointers to avoid disk pointer patching and by 
exporting/importing only the metadata associated with the transferred set of 
tablespaces . 

DEPR: 

When a tablespace is created within a database, it is assigned a tablespace number 
(TSN) , which is unique for that database . Each tablespace contains a control list of 
datafiles, containing a tablespace-relative file number (TRFN) and the corresponding 
datafile. Thus in FIG. 8, tablespaces 810 to 820 each include control lists 830 to 840. 
In particular, tablespace 812 includes control list 832, which has entries indicating 
that a datafile having TRFN of I corresponds to the datafile having an AFN of 2. 
Likewise, a datafile with a TRFN of 2 corresponds to the datafile with an AFN of 3. 

DEPR: 

A TRFN is unique among the datafiles of a given tablespace, but need not be unique among 
all the datafiles of a database . In the example of FIG. 8, tablespace 812 has a TSN of 2 
and control list 832. Datafile 212 of tablespace 812 has a TRFN of 1 according to 
control list 832, yet datafile 230 in tablespace 520 also has a TRFN of 1 according to 
control list 840. Thus, both datafile 812 and datafile 230 have the same TRFN, but they 
are distinct datafiles. However, the TSNs are different: tablespace 820 has a TSN of 6, 
and tablespace 812 has a TSN of 2. 

DEPR: 

Tablespace-relative disk pointers allow datafile disk pointers to be copied without 
having to be patched. FIG. 9 shows a destination database 900 with two tablespaces, 910 
and 912. Tablespace 912 comprises two datafiles, 312 and 314, and control list 932. 
Datafile 314 is an index, index 360, built on table 350, and contains 

tablespace-relative disk pointer 980 with a tablespace-relative DBA of 1:300, pointing 
to data item 970 in table 350. Copying tablespace 812 of database 800 yields database 
1000 in FIG. 10(a) . Disk pointer 880 maintains the same value, 1:300, but still points 
to data item 270, even in database 1000. During the plugging in process tablespace 812 
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is copied or accessed withol^Ppatching any of its disk pointeflWLn datafile 212 or 214. 
The administrative information in the control list 832 indicating the mapping between 
' the TRFNs for tablespace 812 is reconstituted as described below. Non-dangling disk 
pointer 880 has the identical value in database 1000 as in database 800. However, 
tablespace- relative disk pointer 880 still points to data item 270, because the TSN for 
disk pointer 980 is 1, and control list 832 maintains that a TRFN of 1 indicates 
datafile 212. Thus, tablespace-relative disk pointers avoid the aliasing problem 
associated with the absolute disk pointer technology. 

DEPR: 

Tablespace -relative disk pointers also remain valid after being transferred to a target 
database . FIG. 10(b) illustrates the result, database 1002, of plugging tablespace 820 
of database 800 into database 900. Tablespace -relative disk pointer 882 contains the 
same value of source database 8 00, 1:300. However, disk pointer 8 82 continues to point 
to data item 272, because the tablespace relative file number portion of the disk 
pointer is still valid. 

DEPR: 

Some objects are related to associated with other objects. For example, indexes are 
typically associated with tables. An index that is built on a table contains database 
pointers to that table. As another example, referential integrity constraints can 
associate with several tables. In addition, all partitions of a table are related to 
each other. The database system typically keeps track of these relationships in a data 
dictionary. Therefore, the database system is able to determine whether a set of 
tablespaces , the "pluggable set," contains pointers outside the pluggable set. 

DEPR: 

For example, when tablespace 816 as the only tablespace in a pluggable set is unplugged 
from database 8 00 of FIG. 8, the database system inspects the objects in the tablespace, 
table 254 and index 266. Index 266 is built on table 250, which is found in tablespace 
812, which is not in the pluggable set. Therefore, the database will drop index 266 from 
unplugged tablespace 816. Accordingly, when the pluggable set is plugged into database 
900 the result is shown in FIG. 10(c), where only table 254 is transferred to database 
1004 . 

DEPR: 

In the example, if tablespace 816 is being copies, the database system determines that 
index 266 may contain pointers outside the pluggable set, because it is built on a table 
outside of the pluggable set. When the user is prompted, the user may allow index 266 to 
be dropped from the pluggable set or expand the pluggable set to include tablespace 812, 
which contains the table, table 250, upon which index 266 was built. In either case, the 
result is a self-contained pluggable set. 

DEPR: 

A special case occurs when a data warehouse is periodically refreshed with tablespaces 
from an OLTP database . In this case, the user should drop the tablespaces in the data 
warehouse before plugging a more recent version of the pluggable set from the OLTP 
database into the data warehouse. This procedure avoids a potentially large number of 
external name conflicts. 

DEPR: 

The CD-ROM is distributed to the target database by conventional means (e.g., by mail, 
overnight delivery, file transfer protocol) . At the target site, the CD-ROM pluggable 
set is plugged into a target database, by loading the CD-ROM into a CD-ROM drive of the 
target computer system. After the set of tablespaces is thus available, the metadata 
contained on the CD-ROM is imported into the target database . Since the user data of the 
CD-ROM pluggable set is already in native format, the tablespaces in the pluggable set 
need not be converted or even copied and may remain in the CD-ROM driver, conserving 
disk space of the target site. 
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The present invention discloses a technique for restoring a database in a computer. In 
accordance with the present invention, the database contains objects and is stored on a 
data storage device connected to the computer. After a system failure, a log file is 
read. The log file contains one or more modifications to the database objects. Each 
modification has an associated data page and time stamp or sequence number. The 
modifications are sorted by at least one predefined sorting key value. The sorted 
modifications are then grouped by database object. The sorted modifications are applied 
to each database object in parallel. 

18 Claims, 6 Drawing figures 
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BSPR: 

A common technique for storing a database in a data storage device is to assign each 
table to a tablespace . A tablespace is a named collection of one or more datasets. Each 
tablespace is physically divided into equal units called data pages, and each data page 
contains one or more tuples of data . 
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